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Abstract. A rigorous lower bound is obtained for the average resolution of any 
estimate of a shift parameter, such as an optical phase shift or a spatial translation. 
The bound has the asymptotic form kj/(2\G\) where G is the generator of the 
shift (with an arbitrary discrete or continuous spectrum), and hence establishes 
a universally applicable bound of the same form as the usual Heisenberg limit. 
^ The scaling constant kj depends on prior information about the shift parameter. 

O | For example, in phase sensing regimes, where the phase shift is confined to some 

small interval of length L, the relative resolution <5$/L has the strict lower bound 
(27re 3 )- 1 / 2 /(2m|Gi| + 1), where m is the number of probes, each with generator 
Gi, and entangling joint measurements are permitted. Generalisations using other 
resource measures and including noise are briefly discussed. The results rely on 
the derivation of general cntropic uncertainty relations for continuous obscrvablcs, 
which are of interest in their own right. 



1. Introduction 

In many measurement scenarios, an environmental variable acts to translate or 
shift a property such as the optical phase or position of a probe state. Accurate 
estimation of the shift parameter allows a correspondingly accurate measurement of 
the environmental variable. For example, interfcrometric measurements of quantities 
such as temperature, strain and gravitational wave amplitudes rely on estimation of 
an optical phase shift. An important aim of quantum metrology is to determine the 
fundamental bounds on the resolution of such estimates [TJ [2] . 

One useful tool in this respect is the quantum Cramer- Rao inequality [Hill], which 
can be used to obtain bounds on the resolution of a shift parameter in terms of the 
variance of the operator that generates the shift [U [31 El [SJ [5] . For example, the root 
mean square error of any unbiased estimate A of a shift parameter A. for any fixed 
value X — xq, satisfies [5] [6] 

6X X0 = ((A - x ) 2 ) 1/2 > 1/(2^AG), (1) 

where G is the shift generator, AG is the uncertainty of G for the probe state, and m 
is the number of (independently measured) copies of the probe state. 

One can also use the Cramer-Rao inequality to derive resolution bounds in term 
of other quantities, such as the maximum and minimum eigenvalues of G when they 
exist [5] , or the mixedness of the probe state [HJ . While such bounds are universally 
valid, scientists often wish to obtain bounds in terms of other resources, such as the 
average energy or mean photon number of the probe state. But, for example, if G = N 
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is the photon number of a single mode field, then A7V can be arbitrarily large or small 
relative to the average photon number (N) . A simple example which proves this point 
is a probe state with number distribution p = 1 — w, p n = w for some n^O, and 
vanishing otherwise. Then (N) / AN = [(1 — w)/w] 1 ^ 2 ranges over (0,oo) as w ranges 
over (0,1)0 Thus, the lower bound (fTJ) does not limit phase resolution in terms of 
average photon number. This illustrates that the quantum Cramer-Rao inequality 
cannot always be used to derive bounds in terms of the resources of interest required 
to achieve a given resolution, in particular when the eigenvalue range is unbounded. 

An alternate tool for bounding resolution is the Heisenberg limit [TU]. While 
less developed than the quantum Cramer-Rao inequality, it may be heuristically 
characterised as an asymptotic lower bound on measurement resolution that scales 
inversely with the number of resources available and that is achievable, up to a constant 
factor [TTJ [TH H31 HH HSl 02] ■ These references use various ways to quantify 
mcausurcmcnt resolution and number of resources, but at this point it suffices to 
quote the earliest result (obtained numerically) : the root mean square error of a 
canonical phase estimate on a single mode field, for any applied phase shift </>o, can 
asymptotically scale no better than 

6^ > 1.38/ (N). 

This scaling with the average photon number (N), rather than with AN as in (TTJ), has 
the advantage of providing a necessary condition on the energy resources required for 
a given phase resolution. 

Recently, progress has been made in generalising the Heisenberg limit to obtain 
non-asymptotic resolution bounds for arbitrary estimates of shift parameters. These 
may be called Heisenberg-style bounds. First, for a shift generator G with a discrete 
spectrum and finite lowest eigenvalue g m m, Giovannetti et al have bounded the root 
mean square error of any estimate, when averaged over two fixed values of the shift 
parameter, by 0.076/ (G — g m in) — providing that the fixed values and errors satisfy a 
particular constraint |18j . Second, when G is further restricted to integer eigenvalues, 
Hall et al have obtained the constraint-free bound of 0.559/(G — g m - m + 1), for the 
root mean square deviation of any estimate, when uniformly averaged over all values 
of the shift parameter [19] . 

Despite this progress, none of the above results characterise the overall 
performance of a given estimate in the important case that prior information is 
available about the value of the shift parameter. For example, in a phase sensing 
regime |20j , such as gravitational wave detection, the value of an applied phase shift is a 
priori known to lie within some small interval about zero. Hence, only the performance 
of the estimate over the interval is of interest — it is irrelevant how well or how badly 
the estimate may perform outside this interval. Indeed, in a stimulating paper, Rivas 
and Luis have recently proposed a phase estimation scheme that improves on the 
scalings of the above Heisenberg bounds for small phase shifts [5T] . 

It is therefore of great interest to determine fundamental bounds when prior 
information is available to be exploited. For example, the quantum van Trees 
inequality generalises the quantum Cramer-Rao inequality, to bound the mean square 
error averaged over the prior probability density q(x) of the shift parameter [BT I22j. 
leading to AG in (TTJ) being correspondingly replaced by [VarG + Fq] 1 / 2 , where 
F q = J q(x) (d\nq(x)/dx) 2 is the Fisher information of q(x). Note however that, 

f Moreover, if one chooses w = n — 3 / 2 , then the value of AN diverges with increasing n, while (N) 
becomes arbitrarily small. 
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for estimates confined to some bounded interval with q(x) uniform over this interval, 
F q vanishes and no improvement is obtained over the lower bound in ([T]). 

Tsang has very recently obtained the first results exploiting prior information 
in the context of Hcisenberg-style bounds [53]. For example, he has shown that, 
for discrete shift generators and a prior density q(x) uniform over any interval of 
length L, the root mean square error of any estimate is bounded by a function which 
asymptotically approaches 0.154/(G — <?mm), under a constraint that L is sufficiently 
large (see also Ref. [24]). 

The central result of the present paper is a general Heisenberg-style lower bound 
that takes arbitrary prior information into account. It is constraint-free, does not 
require the generator to be discrete, nor to have a finite lowest eigenvalue. In 
particular, we show that the average mean square deviation, (SX) 2 = ((X — X) 2 ), of 
any estimate X of some shift parameter X, over any prior distribution of X , satisfies 

** S c + 2A-. ( '| G - 9 |) ■ < 2 > 

Here k\ depends on the prior information available about the shift parameter, G is the 
shift generator, and g is an arbitrary eigenvalue of G. For continuous spectra, c = 
and A — 1, while for discrete spectra c = 1 and A is the minimal spectral gap of G. 
Further, if the spectrum of G has a smallest value, g min , then the factor of 2 in the 
denominator can be removed for the choice g = <? m i n - 

The scaling constant fcj in ([2]) depends on the prior distribution of the shift 
parameter, and can in principle be arbitrarily small for a sufficiently narrow prior 
distribution. In particular, for any generator with a discrete spectrum, if the shift 
parameter X is known a priori to be randomly distributed over an interval of length 
L, it will be shown that kj > L(27re 3 ) -1 / 2 . Thus there is a fundamental 'relative 
resolution' bound for how well the shift can be resolved, relative to the size of the 
interval to which it is confined. Moreover, with the additional assumption that the 
state is a number m of identical copies of some probe state, each with discrete generator 
G, it will be shown that inequality ([2]) implies that 

SX (2^e 3 )-V 2 

L ~ I + 2mA- 1 (|G-.g|)' 1 ' 

with a similar bound conjectured when G has a continuous spectrum. Note that 
this inequality allows for arbitrary measurements over the m probes, even entangling 
joint measurements, unlike bounds obtained from the Cramer-Rao inequality, which 
assumes a fixed measurement. The scaling of ^ with mT 1 contrasts to the mT 1 ! 2 
scaling in (Q}. While neither bound is necessarily achievable for a given probe state 
and a given m, the scaling as mT 1 ! 2 in ([I]) is as expected from elementary statistics. 
Hence we do not expect that our bounds would be tight for a fixed probe state and 
large m. 

Similarly to if the spectrum of G has a smallest value g m i n then the factor 
of 2 in the denominator of © can be removed for g = g m i n . For example, for 
phase shifts generated by the photon number operator N of a single mode field, the 
relative resolution of any estimate over an interval of length L is bounded below by 
(27re 3 )~ 1//2 /(I + mN), generalising the result of Hall et al [19], which was limited to 
the case of a completely random phase shift, with L = 2n. 

The results of the paper are obtained via the derivation of suitable entropic 
uncertainty relations for the shift generator G and the error X — X (section 2 and 
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appendices). Examples are given for estimates of optical phase, time, and spatial 
displacements, including generalisations to alternative resource measures such as the 
support of the energy distribution of the probe state (sections 3 and 4). A further 
generalisation of inequalities © and §S§ is given which quantifies the effects of noise 
(section 5), followed by a brief discussion (section 6). 

2. Estimation bounds from entropic uncertainty relations 

2.1. General estimation schemes 

Consider a general shift parameter estimation scheme, in which a probe state po 
undergoes a shift generated by some operator G, to the state p x = e~ lGx poe lGx . A 
measurement of observable M on the probe then outputs some estimated value x of 
the actual shift value x, and is described by some positive-operator-valued measure 
(POVM) M = {M x } [21 H]. A standard notation will be used in which random 
variables and operators appear in upper case, and (eigen)values of these quantities 
appear in lower case. Thus, the shift parameter will be denoted by X, and its estimate 
by X. 

If the prior probability density of the shift parameter is denoted by q(x), then 
the probability density of the error or deviation, Y = X — X, of the estimated value 
from the true value, is given by 

Py(v) = J dxq(x)px(x+y\x) = J dxq(x)tt[p x M x+y ] = tr[poM„](4) 

where the POVM M = {M y } is defined by 

M y := J dxq(x)e lGx M x+y e- tGx . (5) 

A 'good' estimate of the shift parameter will be one for which the error is small 
on average, i.e., for which Py(v) is highly peaked about y = 0. This may be quantified 
by the mean square deviation 

(SX) 2 := ((X X) 2 ) = (Y 2 ) = J dyy 2 p Y (y). (6) 

Note that for periodic shift parameters, such as phase, the integration may be taken 
over an interval centred on y = [19j . The quantity SX Xo in ([T]) corresponds to the 
case q(x) = 6(x — xq). 

Equations (j4}-([6]) generalise the case of phase estimates considered in Hall et al 
[19] . where the prior density was restricted to be uniform, i.e., q(x) = l/2it. Under 
this restriction the POVM M is covariant, with M x+y = e~ lGy M x e lGy , allowing a 
connection to be made between SX and G via an entropic uncertainty relation for 
canonically conjugate number and phase operators [19J . However, that method fails 
whenever q(x) is non-uniform, as M. is no longer covariant. Note also that q(x) is 
necessarily non-uniform for non-periodic shift parameters. 

It turns out that the key to generalising the approach of Hall et al is the 
extension of existing entropic uncertainty relations for arbitrary discrete observables 
[25j [26j [27] , to the case of continuous POVMs, such as M. in equation ([5]) above. The 
necessary extensions are derived in appendix A. It will now be shown how these lead 
to Heisenberg-style lower bounds for 8X, as per inequalities and ([3]). Examples 
and generalisations are given in sections 3-5. 
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2.2. Exploiting entropic uncertainty relations 

Suppose that one has an entropic uncertainty relation for the observables G and M 
(the observable corresponding to the POVM A4) of the form 

H{G)+H(M) > In if j 

for some constant A/. Here H{A) denotes the Shannon entropy of the measurement 
distribution of observable A, for a probe in state po- Several such uncertainty relations 
are given in appendix A. In general, A/ depends on the prior information encoded in 
the prior density q(x), and its form is discussed in section 2.3 below. 

From equation (0| the statistics of M and Y arc identical, and thus the above 
uncertainty relation can be rewritten as 

H(X-X) = H{Y)>\nKi-H{G). (7) 

Furthermore, consider the variational quantity J = H(Y) + a(l) + f3(Y 2 ), where a 
and j3 are Lagrange multipliers fixing the normalisation of py(y) and the value of (Y 2 ) 
respectively. The variational equation 5 J/ Spy = leads directly (Chapter 12 of j2"8] ) 
to the upper bound H(Y) < | In [27re(y 2 )], saturated by the Gaussian distribution 
py(y) = (27r(F 2 )) -1 / 2 exp[— y 2 /2(y 2 )]. Combining this bound with equations (JU) and 
(0 yields the lower bound 

for the root mean square deviation of the estimate. 

Inequalities (|7|) and (j^J provide information-theoretic bounds on the performance 
of the estimate, in terms of the entropy of the shift generator for the probe state. 
This is already useful in contexts where entropy itself can be considered as a resource. 
Inequality © is also useful for determining alternative bounds on resolution, under 
various constraints on G, as will be discussed in sections 3 and 4. 

Finally, as shown in appendix B, if g is an arbitrary eigenvalue of G, then the 
entropy of G is bounded above by 

H{G) < l+ln[c + 2A- 1 (|G-. 9 |)] , (9) 

where the factor of 2 can be dropped if G has a minimum eigenvalue g mm and g = g min . 
For discrete generators, A = min g ^ g » \g' — g"\ is the minimum spectral gap of G, and 
c = 1. For continuous generators, A = 1 and c = 0. Substitution of ([9]) into ([8} 
immediately yields inequality ([2]), with 

fc 7 = (2Tre 3 )- 1/2 K I . (10) 

Before discussing specific examples and generalisations of the generic Heisenberg 
bound ((2J, the dependence of the constant Ki on the prior information encoded in 
q{x) will be examined, yielding a derivation of the relative resolution bound ([3]) for 
discrete generators, and an analogous conjectured bound for continuous generators. 

2.3. Dependence on prior information 

2.3.1. Discrete generators: For a shift generator G with a discrete spectrum, a 
suitable scaling constant Kj follows from inequality (|A.2[) of appendix A as 

A7 1 = SUp (^TgMyTg^) = SUp (tP g \My\tPg) (i>\Tg\i>) 

9,v,i> g,y,i> 
g,y 



Heisenberg- style bounds for arbitrary estimates of shift parameters 



6 



where T g denotes the projector on to the unit eigenspace of eigenvalue g, \ip g ) := 
r s |^ , )/(V , |r s |V') 1 ^ 2 j an d \g) denotes any normalised eigenstate of G. The last equality 
holds since (i) \ip g ) is always equal to some \g) by construction, and (ii) the factor 
{il)\Y g\ip) is maximised when = \tjj g ). Hence, using equation ([5]) for M y , one has 

A 7 . = sup W) to |M, + „| 9 ) = sup U q( z) P ,(* + v), (ID 
g,y J g,v J 

where p g {x) denotes the measurement distribution when the probe state is replaced 

by eigenstate \g) of G. 

Thus, the value of Kj is determined by the maximum possible value of the 

convolution of the prior probability density q(x) with the measurement distributions 

{p g }- Moreover, noting that 



dxq(x)p g (x + y) < / dxp g (x + y) 

where q max denotes the maximum value of the prior probability density q(x), one has 

Ki > l/g max . (12) 

This constraint on Kj leads to the universal relative resolution bound ([3]) for 
discrete generators. In particular, an estimate based on (a possibly entangling joint 
measurement on) m copies of a probe state corresponds to replacing po by <S) m Qo 
and G by Gt = G\ + Gi + . . . + G, n , where Gj refers to G for the j-th copy. Note 
that mg is an eigenvalue of Gt, |Gi + . . . + G m — mg\ < \G\ — g\ + . . . + |G m — g\, 
and At = A. Hence, choosing q(x) to be uniform over an interval of length L and 
vanishing elsewhere, g max = 1/L and inequality (|3|) follows from relations ([2]), (fTUf 
and ([12]). 

The lower bound in ([T2|l can be approached, in principle, if the measurement 
distribution p g (x) is sufficiently peaked around some value x g , for some eigenstate \g) 
of G. In particular, this allows y to be chosen in (fTT|) such that p g (x + y) is peaked 
around the maximum value of q(x). Thus, the more the estimate is concentrated 
around some value, given a system prepared in some eigenstate \g) of G, the closer 
the constant K\ will be to l/g ma x, allowing the possibility of of approaching the lower 
bound in equation ([3]). This possibility is further discussed in section 3.1. 

Note finally that the bound in ([12")) is only useful when the prior probability 
density q{x) is not infinitely peaked. Although q max will be finite for any physical prior 
distribution, it is of interest to find stronger bounds not subject to this limitation. 
For example, note that any probe state corresponding to an eigenstate \g) of G 
is invariant under shifts generated by G, implying no corresponding estimate can 
improve on prior knowledge about the shift parameter. It is therefore natural to 
define an estimate to be 'ignorance respecting' if the measurement distribution for any 
eigenstate of G, p g (x), is not any better concentrated than the prior probability density 
q{x), in the standard sense that p g majorises q |29j . This implies in particular that 
J dx F(p g (x)) < J dxF(q(x)) for any continuous convex function F [29] . Choosing 
F(z) = z 2 , and writing C r :~ Jdxr(x) 2 for probability density r, it follows via 
equation and the Schwarz inequality that 

KJ 1 = sup / dxq(x)p g (x + y) < sup[G g G Pg ] 1/2 < C q 
g,y J g 

for ignorance- respecting estimates, i.e., 



Kj > 



dx q{xY 



-l 



(13) 
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This is stronger than the lower bound (|12[) . and can be nontrivial even when g max = oo. 

2.3.2. Continuous generators: Similar results hold for a shift generator G with a 
continuous spectrum. In particular, from equation (| A.6|) of appendix A, identifying G 
with Y, a suitable scaling constant Kj follows as 

Kj 1 = 8wp{g\M y \g) = sup / dxq(x) (g\M x+y \g). (14) 
g-y g,y J 

Here \g) ranges over all (typically degenerate) unit eigenkets appearing in any spectral 
decomposition of T g (i.e., \g) = \g,d) for some d in an orthogonal expansion 
T g = J2d \9id)(g,d\, where d is an arbitrary degeneracy index). 

It has not been possible at this time to prove a general relative resolution bound 
for continuous generators, analogous to ([3]). In particular, for continuous generators 
the ket \g) is not normalisable, so that (g\M x+y \g) in ([14")) does not correspond to some 
measurement probability density p g (x). However, it is conjectured that 



SX J^/2e . . , 

X " 2L(|G- g |) (conjecture) (15) 

for such generators, if the prior distribution is uniform over an interval of length L. 
Here, as always, the factor of 2 in the denominator can be dropped for a bounded 
spectrum for the choice g = g min . The support for this conjecture arises from a 
correspondence between the cases of no prior information and covariant estimates, as 
will now be detailed. 

First, if the POVM {M x } is covariant, then from equation ([5]) it follows that 
My = J dx q(x) My = M y , for any prior distribution q(x). Hence, covariant estimates 
cannot make use of any prior information. Conversely, any estimate that does makes 
use of prior information must be noncovariant. 

Second, for continuous generators, H(G) + H(M) > lnvre for any covariant 
POVM A4 from equation (|A.11[) , where the bound is saturated when M is canonically 
conjugate to G. Hence one may take Kj = ire for covariant estimates. Moreover, for 
m probe states, any covariant estimate of X satisfies the same entropic uncertainty 
relation with G replaced by Gt = G± + ...G m . Hence, using ([2]), (p~0|) . and 
\G\ + . . . + G m — mg\ < \G\ — g\ + ...+ \G m — g\, one has the rigorous bound 

d^cov > rr, (16) 

2m{\& - g\) 

for covariant estimates, where the expectation value is with respect to Qq. As usual, 
the denominator can be replaced by m(G — g m in) if G is bounded below. This 
bound includes covariant estimates based on entangling joint measurements; a class 
of covariant estimates based on independent measurements is also briefly discussed in 
appendix A. 

Finally, since inequality (|16[) corresponds to no use of prior information, it can 
be interpreted in a limiting sense as a resolution bound relative to a prior distribution 
q(x) which is uniform over the whole real line. This corresponds to L — > oo in (|15[) . 
where more generally the conjecture claims the same relative bound holds for prior 
distributions uniform over any finite interval of length L. 
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3. Examples: discrete shift generators 

3.1. Phase shift estimation 

For the case where the spectrum of the generator G is a subset of the integers one has 
e iG4> _ e iG(<p+2-n) ^ an( j h ence the corresponding shift parameter $ may be treated as a 
phase parameter, taking values on the unit circle. It follows from equations ([2]), (jTOJ) 
and that 

> > (2 TO 3)-V^ J (We^ 

" (2|G- 5 | + 1> " 27rg max (2|G- 5 | + l) ' 1 J 

where q mayi > 1/27T is the maximum value of the prior probability density g((/>) for 
Moreover, if it is known a priori that the phase shift is confined to an interval of 
length L, then q(x) = L _1 over the interval and vanishes elswhere, and hence from 
equation §3§ for m identical systems the relative resolution is bounded by 

5$ (2^3)-l/2 

L ~ 1 + 2m( |Gi — g\)' [ ' 

As previously, the term 2|G — g\ in the denominators of (|T7|) and (fTB"]) may 
be replaced by G — g m in when the spectrum is bounded below. Hence for a single 
mode field, with number operator TV > 0, one has 5& > (2tt / e 3 ) 1 / 2 / {N + 1) in 
the case of no prior information, with q((f>) = 1/2tt = q ma , x . However, strong 
numerical evidence has been given that the numerator (27r/e 3 ) 1 / 2 sa 0.559 for this 
case can be replaced by a best possible value of « 1.376, which is asymptotically 
achievable for large (N) on a suitable probe state via the canonical phase estimate 
M = (27T)- 1 J2 m ,n e^™"™^ \m) (n| [H|30]. Hence it is conjectured that the bounds 
(|17[) and (|18j) are not tight, and that a similar replacement can be made in the case 
of arbitrary prior information. 

To exploit any prior information, corresponding to a scaling constant Ki < 2ix in 
(|17|) . is nontrivial. Note that a covariant estimate is not suitable, as for any such 
estimate one has (g^l^g) — (2ir)~ 1 for all eigenstates \g) of G [U [12], yielding 
Kj = 2tt from equation pT|) . Indeed, equation (fTTj) implies that a necessary condition 
for exploiting prior information is that the estimate must return a nonuniform 
distribution over [0, 2ir] when some number eigenstate is input as a probe state. This 
is counterintuitive, since such eigenstates are invariant under phase shifts and hence 
cannot generate any useful phase information. However, it must be kept in mind that 
it is the actual probe state, po, rather than a notional probe state, that is relevant for 
actually estimating the phase. 

For example, in the recently proposed phase estimation scheme of Rivas and Luis, 
applicable to small phase shifts of a single mode field generated by the photon number 
operator N, the estimate is proportional to the result of a homodyne measurement 
on the probe state [21]. Hence, if a number eigenstate \n) was to be input as a 
probe state, such a homodyne measurement would generate the (clearly nonuniform) 
statistics proportional to the quadrature distribution |(x|n)| 2 . Thus, the Rivas and 
Luis scheme satisfies the above necessary condition for exploiting prior information. 
However, as noted by Rivas and Luis following their equation (29), while their estimate 
has an arbitrarily low root mean square error <5$o f° r a fixed phase shift value of zero, 
it can only further achieve an error 8<t,p w <S$o "C L, for each 4> in an interval of length 
L about = (corresponding to <g; 6(f) in the notation of [H]), if S<&o 1/iVr, 
where Nt = mtr[N qq\. Thus, unfortunately, noting that the averaged root mean 
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square error 5<5> over the interval is ~ 6<&q in this case, this scheme does not approach 
the lower bound in equation (|18|) above, nor even the numerically optimal bound 
(5$ > 1.376/ (N + 1) when prior information is not exploited [T9"] . 

The above derivation establishes ultimate bounds on phase resolution. In 
particular, for any phase estimation scheme it is impossible to have a scaling better 
than l/g m ax, or an asymptotic scaling better than l/(m\G\). However, it remains a 
challenge for further work to determine how closely the above phase estimation bounds 
can be be approached (up to some numerical factor) , via a suitable measurement and 
probe state. 

Finally, it is worth noting that while (G) is a transparent measure of resources 
when G is a photon number operator or similar, more generally the quantity (|G — g\) 
may not be. However, it is straightforward to generalise the method used in section 
2 to obtain bounds for resolution in terms of other quantities. For example, in the 
context of phase measurements, consider the shift generator G = \{Na — N E ), where 
Na and N E are the number operators for respective single-mode fields input to a Mach- 
Zchndcr interferometer |31j . If the total number of input photons is bounded by some 
fixed maximum value, i.e., Na + Nb < -/V max , then G can only take 27V max + l distinct 
values: 0, ±1/2, ±1, . . . , ±N max /2. Hence H{G) < ln(2iV max + 1), and equations (0) 
and (fT2|) yield the bounds 

se>^- 1/2K >, s 4 > < 2 ™>- 1/a , (19) 

" 2N max + 1 L ~ 2mN max + 1 

for the resolution of the corresponding shift parameter € [0,47r], where m and L 
have the meanings introduced earlier. 

3.2. Time estimation for discrete Hamiltonians 

Time estimates correspond to the case G = E/h~, where E is a Hamiltonian operator 
with lowest eigenvalue eo. Thus G generates the time shift operator e ~ lEt / h _ For the 
case of a discrete spectrum, the Heisenberg bound ([2]) (recalling the factor of 2 can be 
removed for g = eo), in combination with relations (p~0|) and (fT2j) . yields 



ki 1 (2 7 re 3 )- 1 /2 

- 1 + (E- e Q )/A E ~ W 1 + (E- e )/A E ' 1 1 

Here, A# denotes the smallest energy gap between distinct eigenvalues of E (thus, the 
bound is only useful for > 0), and (? m ax is the largest value of the prior density 
q(t). Further, if the time shift is a priori uniformly distributed over an interval of 
length r (which can be arbitrarily large if the system is not periodic), equation ([3]) 
yields the scaling bound 

St > (2ne 3 r^ ^ 0-089 
t " l + m(E-e )/A E ~ l + m(E - e )/A E ' 
where m is the number of identically prepared copies. 

When the energy differences ej — ek are incommensurate, the system will be 
almost periodic and q(t) must be defined over the whole real line. However, despite 
the nonexistence of a uniform prior distribution in this case, one can still define 
covariant time estimates and show, for example, that one cannot typically extract 
more than 1 bit of information from such an estimate [32] . It is therefore expected 
that a noncovariant estimate is required to exploit any prior information. 



Heisenberg- style bounds for arbitrary estimates of shift parameters 



10 



An alternative resource of interest for bounding time resolution, particularly if 
the spectral structure is complex, is the number of energy eigenstates accessible to the 
probe state. For example, if the probe state is a D-level system, then the entropy of 
its energy distribution must satisfy H(E) < In D, implying via the entropic bound (|5J| 
that ST > (27re)- 1 / 2 A" 7 /L>. Hence, via (fT2j) . one has the relative resolution bound 

ST (2ne)'^ 2 0.242 

— > — — ~ — 22) 

t mD mD 

for a time shift uniformly distributed over [0,r]. 

Note, however, that it is reasonable to expect that the actual time resolution of a 
discrete system will have a strong dependence on the detailed structure of the energy 
spectrum. Hence, the above bounds may be well below what is actually achievable. 

Finally, note that for a prior distribution uniform over an interval of length r, 
Tsang has very recently given the lower bound |23| 

ST 0.154ft I 0.329ft 

T-r( J B-e )V r(E-eo) (23) 

for the relative resolution, under the constraint that t(E — cq) > 0.690ft. For 
tAe < 0.154(27re 3 ) 1 / 2 ft ~ 1.73ft this is asymptotically stronger, as a function of 
(E — e) _1 , than the relative resolution bound (|2"Tj) (with m = 1), and is weaker 
otherwise. Hence it provides an improved relative resolution bound for the case of 
a sufficiently small energy gap, or a sufficiently small interval r still satisfying the 
constraint. 



4. Examples: continuous shift generators 

Time estimation for continuous Hamiltonians 

For Hamiltonians having a continuous spectrum, equation ([2j (recalling the factor of 
2 can be dropped for g = eo), together with equation (fT0|) . yields 

where Kj is given in (|14[) . Further, for any covariant time estimate one has 



X rf, > ffi h 0.76 ft/m 

dT cov > — w — 25) 

to (E) - e (E) - e 

from (JT^J) . Finally, if conjecture (fTS")) is correct, then (again dropping the factor of 2) 
the relative resolution bound 



ST Jn/2eh 

— > 7— (coniccturc) 

t ~ mr{E - e ) 

holds for a prior probability density q(t) uniform over an interval of length r. 

It would be of great interest to determine how closely the bound (f2~5j) can be 
approached, via a canonical time measurement on a suitable probe state. Note that 
the variance of the canonical time distribution does not exist if the energy distribution 
of the probe state has a nonzero groundstate component [4|. Hence, such probe states 
would require a different measure of time resolution — e.g., the ensemble length of the 
error distribution, cxp[H(T — T)], which may be bounded from below via inequalities 
© and ©. 
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4-2. Spatial displacement estimation 

As a final example, consider the case of estimation of the displacement of a quantum 
system in some direction, corresponding to the generator G = Pj h for the momentum 
in that direction. The general resolution bound 

r h (2Tre 3 )~ 1 / 2 K I , s 

(26) 

follows from ([2]) and f| 10[) . while for any covariant estimate one has 

git > h V^A^) (27) 

from (|16p. for the case of a single copy, m = 1. The conjecture (|T5|) implies a similar 
bound for the case of a prior distribution uniform over any finite interval, suggesting 
that measurement of the position observable Q conjugate to P, with POVM elements 
M q = \q){q\, is always optimal in this case. 

The above bounds are valid for all values of the reference momentum p. However, 
a variational calculation shows they are strongest when p is chosen to be the median 
value po of the momentum distribution p(k), i.e, when dkp(k) = 1/2. Note that 
the mean and median values are identical for the case of symmetric distributions. 

It is of interest to note that the covariant resolution bound (f2"T)l can be weaker or 
stronger than the Cramer- Rao related bound, H/(2AP), following from (JTJ) [2J [3 |4j [5] . 
For example, for probe states with a Gaussian momentum distribution, p(k) ~ 
exp[— fc 2 /(2cr 2 )], one finds AP = (n/2) 1 / 2 (\P\), implying that the bound ([T]) is stronger 
by a factor of 1.05. Conversely, for a probe state with an exponential momentum 
distribution, p(k) ~ exp[— f3\k\], one finds AP = V2(|-P|), implying that the bound 
((T|) is weaker by a factor of 1.08. 

5. Including the effects of noise 

The presence of noise is expected to decrease the accuracy of any estimate, and hence 
to increase the lower bounds in the previous sections. Consider, for example, the 
very simple case in which independent noise is added to the measurement outcome X. 
Denoting the noise variable by Z, the entropy power inequality |28j and equation ([7} 
imply 

e 2H(X+Z-X) > e 2H(X~X) + e 2H(Z) > ^2g-2ff(G) + & 2H (Z) _ 

Hence, as per the derivation of equation ([8]) , it follows that 

(Abound) 2 + {2^e)~ 1 e 2H ^A V2 , (28) 



^^noisy ^1 



where X no i sy := X + Z and ^A"b OU nd denotes any of the lower bounds of the previous 
sections. Noise thus increases the minimum possible resolution. 

A more physical approach is to consider processes that add noise directly to 
the probe state, and to use stronger entropic uncertainty relations which depend on 
the probe state. For example, for a rank-1 discrete generator G, inequality (IA.SI) 
of Appendix A effectively replaces Ki by Kie s ^°\ where S[po] is the von Neumann 
entropy of the probe state. Hence, replacing po by the noisy probe state n(po) for 
some completely positive map n, the generic lower bound ^ generalises to 

§X > — e s[n(p )] f 2 9l 

dA -l + 2A-i(|G-.g|)n e (29) 
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for such generators, where (-)n denotes an average with respect to n(p )- Similarly, 
the relative resolution bound ([3]) generalises to 

L ~ l + 2A-H\G-g\) u e ' m 

(one is limited to the case m = 1, since Gt = G\ + . . . + G m is not rank-1 for m > 1). 
For a rank-1 continuous generator G, the covariant bound (|16[) generalises to 

dXcov ~ 2(\G-g\)n e ' W 

via uncertainty relation (|A.10[) of Appendix A. As always, the factor of 2 in the above 
denominators may be removed for the choice g = g m - la . 

For example, let N be the photon number operator of a single mode field subject 
to Gaussian noise, where the noise is described by the completely positive map |33) 

A(p)= / d 2 a( 7 rn A )- 1 e- |Q|2/ ^ J D(a) ( o^(a) t , 



and D(a) = exp(aa^ — a*a) denotes the Glauber displacement operator. The 
parameter n\ characterises the average number of photons added to the field, i.e., 
(A^)a = (N) + n x , while the entropy of the field is bounded, both for pure and mixed 
states, by [31] 

S[A(p)} > ln(l + n x ) + n x ln(l + l/n x ) > ln(l + n x ). 
Combining this with the relative resolution bound ([30]) then yields a 'noisy' bound 
<5$ (2^e 3 )- 1 / 2 

— > (32) 

L ~ l + (N)/(n x + l) y ' 

for the relative resolution of any phase estimate, for a prior distribution uniform over 
some interval of width L. It is seen that the resolution becomes poor for sufficiently 
large noise. 



6. Discussion 



The results of the paper establish a rigorous, nonasymptotic and constraint-free lower 
bound for parameter estimation which is in the form of the Heisenberg limit and 
which takes prior information into account. The fundamental bound ^ implies that 
asymptotic scaling better than 1/(|G|) is impossible, while bound (J3J) for discrete 
generators further demonstrates that, for shifts randomly distributed over some 
interval, asymptotic scaling better than 1/m is impossible for the relative resolution, 
where m is the number of probe states (and entangling joint measurements are 
permitted). Bound (|16l) for continuous generators implies a similar 1/m limit is 
unavoidable for the case of covariant estimates. It has also been shown how the 
effects of noise may be quantified in section 5, including resolution bound (|32|) for 
phase estimates on a single mode field subjected to Gaussian noise. 

Examples have been given for estimates of phase shifts, time shifts and spatial 
displacement in sections 3 and 4. These sections also give examples of how the basic 
method of section 2 may be applied to obtain resolution bounds in terms of alternative 
resources, such as the total available photon number in equation (|T9)) and the energy 
support of the probe state in equation (f22|) . For the case of discrete generators with a 
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finite minimum eigenvalue, the corresponding relative resolution bound (|21[) may be 
stronger or weaker than the recent constrained bound ([23")) due to Tsang (section 3.2). 

The fundamental tool used to obtain the above resolution bounds is equation ([7J 
for the entropy of the error in the estimate, H(X — X). As noted briefly in section 2.2, 
the exponential of this entropy may be useful as an alternative possible measure of 
resolution [35 . Further, this measure has tighter corresponding bounds, as it avoids 
the use of the relation between SX and H(X—X) (which is only saturated for Gaussian 
distributions), required for obtaining equation pj. 

Another interesting measure of resolution to consider is the mutual information 
between the shift and its estimate, H(X : X) [28]. While mutual information is 
not dealt with directly in this paper, the relative resolution bounds ([3]), (p~8|) . (|19p. 
(|21|) . (|2"2"|) . (|3T)|) and (f3"2")l do allow an approximate upper bound to be derived for 
H(X : X), for discrete generators, whenever the prior distribution is uniform over 
some sufficiently large interval. In particular, the number of distributions of width SX 
that can be distinguished without error, over an interval of width L, is approximately 
L/6X. The corresponding mutual information, i.e., the corresponding number of bits 
that can be encoded by the shift parameter X and distinguished by the estimate X 
[28] , is therefore H(X : X) w log 2 [L/SX], i.e., the logarithm of the reciprocal of the 
relative resolution. Thus the above mentioned relative resolution bounds place an 
approximate upper bound on the mutual information. For example, for estimates of 
phase shifts uniform over an interval of length L, generated by the photon number of 
a single mode field subjected to Gaussian noise, one has the approximate upper bound 

F(4- : $) < log 2 [1 + (N)/(n x + 1)] + i log 2 [2^e 3 ] (33) 

for mutual information from equation (|32j). If the conjectured bound (|15p is correct, 
one may similarly obtain estimates of mutual information for continuous generators. 

As noted above, the resolution can scale no better than inversely with the number 
of probe states, m, even when entangling joint measurements are permitted. As noted 
in the introduction, this contrasts with the mT 1 / 2 scaling of the Cramer- Rao related 
bound (Q]). Note that for m = 1, the bounds of this paper can be numerically weaker 
or stronger than ([T]) (section 4.2). 

It has been seen in sections 2-4 that covariant estimates do not exploit any prior 
information that may be available. Hence it is only possible to approach the generic 
resolution bounds ([2]) and ([3]) via noncovariant estimates. Further, as noted in section 
3.1, a necessary condition for making use of prior information is that the measurement 
scheme must return a nonuniform distribution when some eigenstate of the generator 
is input as a probe state. While the recently proposed scheme of Luis and Rivas meets 
this condition, it does not approach the corresponding bound (jT5)) (section 3.1). It 
therefore remains an important challenge for future work to determine how closely the 
various lower bounds of this paper can be approached. 

It is also hoped that future work will settle the conjectures made regarding 
the relative resolution bound ([T5|) for continuous generators in section 2.3, the 
improvement in scaling factors for phase estimates in section 3.1, and the strong 
entropic uncertainty bound (|A.9[) in Appendix A. 

Finally, it is noted that the extensions of various entropic uncertainty relations 
to continuous observables, obtained in Appendix A, will find application beyond the 
realm of quantum metrology. 

Acknowledgment: This work was supported by the ARC Centre of Excellence 
CE110001027. 
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Appendix A. Entropic uncertainty relations involving continuous POVMs 

Appendix A.l. One continuous observable 

On a finite-dimensional Hilbert space, the entropies of two obscrvablcs A and 
B, corresponding to finitely-valued POVMs {Aj} and {Bk}, satisfy the entropic 
uncertainty relation [25J [36] 

H{A) +H{B) > -21nmax A\ l2 B]J 2 , (A.l) 

where H^H^ denotes the largest singular value of X, i.e., the square root of the largest 
eigenvalue of X^X. 

To extend this relation to the case where one of the observables is continuously 
valued, first consider some observable C taking continuous values in some compact 
set, with corresponding POVM {Cg}, and partition the range of 8 into a finite 
number of nonoverlapping bins {Pk} of equal size e. Define the discrete observable 
C p associated with the partition via the POVM {Cf } with C[ := f p d9C e . Then, 
for any probability density p(6) of C, there is a corresponding well-defined discrete 
probability distribution pk := J p d9p(9) (equal to the probablity of 9 £ Pk), and an 
associated piecewise-continuous probability density p{9) given by replacing p(9) by its 
average value over the bin Pk for 9 £ Pk, i.e., 



p(9) := (r l [ d9p{9) = e- l P k for 9 e P k 



Note that the probability of 9 £ Pk is identical for both p{9) and p{9), implying the 
latter converges in distribution to the former in the limit e — > 0. Note also that the 
entropy of C p , H(C P ) = — X^fc-Pfc m Pfc, can be rewritten using ^2 k P k = 1 as 

H(C P ) = - lne - J d9 p(9) \np(9). 

The entropic uncertainty relation (|A.1|) for observables A and C p gives 
H(A) + H(C P ) > - maxln(^ /2 Cf A) /2 \ip) 

= -maxln[(^|C fc p |^) (^\A, 

= -maxln[ep v ,,(0)(^|A J #)] 

= - lne - max In [p^,. (9) (ip\Aj 

where \ipj) :— A l ^ 2 \ip) / (^A^ip) 1 / 2 , and p^j{9) is defined analogously to p(9) above, 
with respect to the probability density p^ j {9) = (il>j\C$\ipj) (and thus converges in 
distribution to (9) in the limit e — > 0). Using the above expression for H(C P ) then 
yields, taking the limit e — > 0, 

H(A) + H(C) > - sup In [(^\C e \^) (^|V>] 

3fiA< 

= - supln(^|V /2 C e A- /2 |^) (A.2) 
= -2supln||A) /2 C e 1/2 || 00 , (A.3) 

3,6 

whenever H(C) exists, thus generalising (|A. 1|) . 
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Uncertainty relations (|A.2[) and (|A.3[) may be further extended to the case of 
infinite-dimensional Hilbert spaces, whenever the left hand side exists, by considering 
the limit of a series of projections of the observables onto finite Hilbert spaces. They 
similarly extend to the case of a countably infinite POVM {A,-}, whenever H(A) exists, 
by considering the limit of the sequence of finite POVMs {Ai, A2, . . . , Ad, 1 — Xw=i A?} 
as d — > 00. Finally, they also extend to the case of a non-compact range of C, whenever 
the left hand side exists, by representing the range as the limit of a series of compact 
sets S, and replacing {Ce} by a corresponding series of POVMs {Cg, 9 <G S}L){1 — Cs}, 
where Cs '■= J s d9Ce- Thus, (|A.2j) is valid for any discrete- valued observable A and 
continuously-valued observable C, whenever H(A) + H(C) exists, and similarly for 
(|A~3]) if C\' 2 is well defined. 

For example, for an angular momentum component J z and its corresponding 
conjugate angle <&, inequality in (|A.2[) reduces to the known result 

H(J 2 )+H($) > - sup In [|(^|j)| 2 |O10) 2 ] = In 2*h, 

first given by Bialynicki-Birula and Mycielski [37] , which is saturated by eigenstates 
of J z . 

Appendix A. 2. Two continuous observables 

Uncertainty relation (|A.3[) provides the basis for extending to the case of two 
continuous-valued observables X and Y corresponding to POVMs {X x } and {Y y } 
respectively. The procedure is similar to the foregoing. In particular, partitioning the 
range of x into bins Pj of equal size e, one has the corresponding discrete POVM 
A = {Aj} with Aj := j p dxXj. The entropy of the corresponding observable 
A, given a continuous probability density p(x) of X, is then that of the discrete 
distribution pj := J p dxp(x) = ep(x), where the second expression defines the 

piecewise continuous probability density p(x). Substitution into (|A.3|) . with C = Y 

1/2 

and assuming Y y is well-defined, gives 

-me- f dxp(x)lnp(x)+H{Y)> - 2supln\\A 1 / 2 Y^ 2 \\ 0o 
J i,y 

= - supln(V.|V, 1/2 ^V y 1/2 |^) 

= - sup In [ep^ix) (ip\Y y \ip)] , 

j,y,4> 

where \ip y ) :— Yy^^/^Yy^) 1 / 2 and Pip y (x) is defined analogously to p{x) above, 
with respect to the probability density p^ y (x) — (^j y \X x \if! y ). Taking the limit e — >• 
then gives 

H(X) + H(Y) > - sup \n[{iP y \X x \^ y ) (^\Y y ^)\ 

= - sup ln(ip\YV 2 X x YV 2 \>ip) = -2 sup In 11^/2^1/21^^4) 

x,y,ip x,y 

whenever the entropies and the relevant square roots are well defined. 

Indeed, this uncertainty relation can also be applied in some instances when the 
square roots are not well defined, via taking appropriate limits. For example, for 
conjugate position and momentum observables Q and P, with cigenkets |g) and \p) 
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respectively, and any e > 0, define the 'averaged' momentum observable P with POVM 
{P p } via P p := (Ze)- 1 j e _ e dk\p + k)(p + k\. Then, Pp /2 = {2e) 1 / 2 P p from the spectral 
theorem, yielding 

{i>\Pl /2 Q q Pl /2 \i>) = |(<Z|P P 1/2 |V')| 2 = Y £ £ dk( q \p + k)(p + k\^) 2 

<Y e f dk\{q\p + k)\ 2 J £ dk\(p + m\ 2 , 

where the last line follows from the Schwarz inequality. The first integral evaluates 
to 2e/(27r/i), while the second is never greater than unity for any normalised state ip. 
Hence, substituting X = Q and Y = P into the first inequality of uncertainty relation 
(IA.4I) , and taking the limit e — > 0, yields 

H(Q) + H(P) > In 2irh. (A.5) 

Note that the lower bound is not optimal, although it is close to the optimal bound 
\mreh, saturated by Gaussian pure states [3HJ[37]. However, the same lower bound is 
optimal for the related tight uncertainty relation (|A.10I) below. 

More generally, if the POM {Y y } is a continuous projection- valued measure 

1/2 

corresponding to some Hermitian operator, then although Y y is not well defined, 
an entropic uncertainty relation may be obtained via a similar limiting approach. In 
particular, in such a case Y y Y y ' = S(y — y')Y y , implying that Y y := (2e) _1 dzY y+z 

satisfies Y 1 ! 2 = (2e) 1 / 2 F y . Applying the first inequality in (|A.4[) to X and Y yields 
H(X) + H(Y) > - sup In [ (i> y \X x \^) (VI W> 

x,y,ip L 

with $ y ) :=Y y 1/2 \ip)/{4,\Y y \4,) 1 / 2 = {2e) 1 ' 2 Y y \ij)/^\Y y \i;) 1 l 2 . Noting 
ty\Y y \1>) = (2c)- 1 r <tl>\Y y \1>) < {2e)-\ 



the limit e -> gives H(X) + H(Y) > - sup^^^ ln{ip\Y y X x Y v \ip)/{il)\Yy\'tl>). Finally, 
Y y \ip) is always proportional to some unit eigenket of Y, i.e., Y y \ip) = a\y, d), where d 
is a degeneracy index in some orthogonal expansion Y y = \y, d)(y, d\ of Y y (noting 
such expansions are invariant under unitary transformations of the degeneracy basis), 
and the uncertainty relation 

H(X) + H(Y) > -supln {y\X x \y) (A.6) 

immediately follows, where \y) ranges over all unit eigenkcts of Y y . This generalisation 
of (|A.5|) . holding for any projection- valued measure {Yy}, is of particular relevance to 
generators with continuous spectra (section 2.3.2). 

Appendix A. 3. One rank-1 observable 

When the observable A in relation (|A.1[) is rank 1, i.e., when Aj = \aj)(aj\ for some 
set of (not necessarily normalised) kets {|cij)}, then one has the stronger uncertainty 
relation [26] 

H{A) +H(B) > -In max A 1 / 2 b\ /2 + S[p], (A.7) 

j,k J oo 

where S[p] denotes the von Neumann entropy — tr[plnp] of the density operator p 
generating the statistics of A and B. This recent result, by Coles et al. [26] . 
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generalises an earlier version by Berta et al. for the case of two rank-1 projection 
valued observables A and B [37] ■ 

Using the same methods as in appendix A.l, this relation can be similarly 
generalised to infinite Hilbert spaces and one continuous observable, to give 

H{A) + H{C) > - sup \n{ij;\A) /2 CeA) /2 \^) + S[p] (A.8) 

j,e,1> 

whenever the left hand side exists, for any discrete- valued rank-1 observable A and 
any continuously-valued observable C. 

Unfortunately, one cannot analogously generalise (|A.4[) via the methods of 
appendix A. 2, as these methods rely on use of an observable A which is not rank 
1. However, it is conjectured here that such a generalisation exists, with 

H(X) + H(Y) > - sup H^' 2 X X Y^ 2 \^) + S[p] (A.9) 

for any two continuously valued observables X and Y , providing X is rank 1, both 
entropies exist, and Yy^ 2 is well defined. 

The above conjecture can be proved for the special case of conjugate position 
and momentum observables, using an approach of Pegg et al in which Q and P are 
represented by approximating them as discrete rank-1 observables on a D-dimcnsional 
Hilbert space and taking the limit D — > oo [38j . In particular, substituting the discrete 
observables into (|A.7[) and taking this limit yields 

H(Q) + H(P) - S[p] > In 27rfi, (A.10) 

whenever the left hand side exists. The same method generalises to the case of 
conjugate n- vectors Q and P, with the right hand side of the above relation being 
multiplied by n. This result proves the conjecture in equation (47) of |35j . which 
was made on the basis of a semiclassical argument. Note that, in contrast to 
inequality (|A.5j) . the bound in inequality (|A.10j) is tight, being saturated in the limit 
of equilibrium states in the high temperature limit [35] . 



Appendix A. 4- Covariant observables 

The entropic uncertainty relation 

H(G) + H(M) > hiTre, (A.ll) 

will be obtained here, for any observable M covariant with respect to a continuous 
Hermitian observable G, and also a generalisation to estimates based on m repeated 
measurements, as required for equation (|16[) of the text. 

First, let F denote the observable canonically conjugate to G, with POVM {F x } 
given by H [33 

F x := e~^F^ G \ F Q := (2^ £ / dgdg' \g,d){g',d\, 

d 

where the projection T g has the orthogonal expansion T g = ^2 d \g,d){g,d\. Now, 
any density operator p may be formally mapped to a density operator p* of a 
1-dimensional particle, with position coordinate Q and eigenkets \q), via p* := 
J2dl dqdq'\q)(q'\ (q,d\p\q' ,d), (where one takes \g, d) := for g outside the spectrum 
of G) . By construction, the probability distributions of F and G for p are identical to 
the probability distributions of Q and P for p* , where P is the momentum observable 
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conjugate to Q in units such that H = 1. From the known entropic uncertainty relation 
for Q and P [3H [37J , it immediately follows that 

H{F) + H(G) = H(Q) + H{P) > hiTre. 

Further, the probability distribution of any covariant observable M for some state 
p is equal to the probability distribution of the conjugate observable F for some 
corresponding state p', where p and p' have the same probability distribution for the 
observable G [35]. Hence the above bound also holds with F replaced by M, yielding 
(|A.11[) as desired. 

It is of interest to consider a class of covariant estimates m based on independent 
measurements of the covariant observable M, made on m respective copies of the 
probe state. Let x = f(xi, x 2 , ■ ■ ■ , x m ) denote the corresponding estimate of the shift 
parameter, where Xj denotes the individual estimate given by the j-th measurement. 
It will be assumed that the estimate is shift-invariant, i.e., that the function / satisfies 
the identity 

f(x x + y,x 2 +y,.. -,x m + y) = f(xx,x 2 , ■ ■ -,x m ) + y. (A. 12) 

This is satisfied, for example, by any weighted mean / = w\Xi + ...w m x m with 
u>i + . . . + w m = 1. Differentiating with respect to y and taking y — gives the 
equivalent condition ^2j(df/dxj) = 1. 

To show that such an estimate is covariant with respect to the corresponding 
shift generator Gt = G\ + . . . + G m , let x denote the vector (xi, . . . , x m ), and define 
S x := {x : /(x) = x}. The POVM {M~} corresponding to the joint estimate is then 
given by M~ = J s dxMz 1 (8 . . . ® M± m and therefore 

e -iG-TX M J e iG TX = I dy e~ lGlX M yi e lGlX ® . . . <g> e~ lG ™ x M Vm e lG ™ x 

JSy 

= dy M x+Vi (g) . . . ® M x+Vm 

JSy 




dy M yi ® . . . (8 M Vm = M J X 



where the last line follows via the change of variables y m — > y m — x and using the 
above identity for /. Hence M J is covariant with respect to Gt- 

Appendix B. Entropy bounds for fixed (\G — g\) 

Appendix B.l. Discrete generators 

First, consider the special case G where the spectrum of G is some subset of the 
integers. Now, the maximum entropy of any distribution over the integers for a fixed 
value of (| n — no |) corresponds to maximising the variational quantity 

J = - ^Pn lnp„ + a^p n + /3 ^ \n - 7l |p„, 

n n n 

where a and /3 are Lagrange multipliers. It is convenient to work with the displaced 
distribution q n := p„ + „ , for which the variational equation 8J/Sq n = has the 
solution q n = Av^ , for suitable positive constants A and v determined by the 
contraints. One easily finds that A = (1 — v)/(l + v) and n := (|n — no|) = J2 n \n\q n = 
2v/{l — v 2 ). Further, H max (G) = — J2 n 1n[lnA + \n\ lnu] = — ln^4 — nlnv. Inverting 
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the relation between n and v gives v = (1 + l/n 2 ) 1 ^ 2 — l/n = e x , where sinhx = 1/n, 
and the maximum entropy simplifies to 



#max(G)=ln (Vn 2 + l + n + x/ sinhar < ln[2n + 1] + 1, (B.l) 

where the final inequality follows by adding a term 2n under the square root and noting 
that x < sinhx (e.g., from the Taylor series expansion of sinhx). This corresponds to 
the bound §§§ with A = 1 = c. Further, if the spectrum of G is bounded below by 
n m j n , a similar calculation for the choice no = ?i m ; n leads to ff m ax(G) < In (n + 1) + 1 
[19], i.e., to replacement of the factor of 2 in (jB.ll) by unity. 

More generally, let G have an arbitrary discrete spectrum, and let {g n } denote 
the distinct eigenvalues of G in increasing order. Thus, the minimum spectral gap is 
given by A = minj^+i — gj). Hence, if n > tiq then 

\9n - 9n„ \ = (g n - 9n-l) + ■■■ + (fjn„ + l - 9n ) >\n- n |A, 

and the same relation also follows for n < hq, implying for any probability distribution 
{p n } of G that 

(\ G - fifnol) = ^2 \ gn ~ 3n \Pn >^\n- n \Ap n = uA, 

n n 

where ri := (\n — no\) as before. Since inequality (|B.1[) bounds the maximum possible 
entropy of any discrete distribution {p n } for a fixed value of n, it follows immediately 
that the entropy of G is bounded by 

H{G) < \n[2n + 1] + 1 < In [2A _1 (|G - g no \) + l] + 1. 

This establishes the bound (j9|) for an arbitrary discrete spectrum. Again, if the 
spectrum of G is bounded below by (? m i n , the factor of 2 may be replaced by unity for 
the choice g no = g n 



J mm • 



Appendix B.2. Continuous generators 

If G has a continuous spectrum, then a bound on the maximum entropy of G, under 
the constraint of a fixed value of (\G — g'\), is obtained by maximising the variational 
quantity 

J = - / dgp{g)\np(g)+a I dg p(g) + (3 / dg\g~g'\p(g) 



over all probability distributions on the real line. It is convenient to work with 
the displaced distribution q(g) := p(g + g'), having the same entropy as p(g), for 
which the variational equation SJ/Sq = yields q(g) = Ae^^ 9 ^. The constants A 
and f3 are determined by the constraints to be A = 1/(27?) and j3 = 1/g, where 
g := (\G — g'\) = J dg\g\q(g), allowing the maximum possible entropy to be calculated 
as 

ffmax(G) = - Jdgq(g)[lnA - p\g\] = - In A + pg = \n[2g] + 1, 

establishing the bound (jH]) for an arbitrary continuous spectrum. If the spectrum of 
G is bounded below by <7 m in, a similar calculation for the choice g' = g m i n leads to 
Hmax(G) < ln(<7 + 1) + 1, i.e., to replacement of the factor of 2 by unity. 
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